HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments

نویسندگان

چکیده

Deep neural networks (DNNs) exploit many layers and a large number of parameters to achieve excellent performance. The training process DNN models generally handles large-scale input data with sparse features, which incurs high Input/Output (IO) cost, while some are compute-intensive. exploits distributed computing resources reduce time. While heterogeneous resources, e.g., CPUs, GPUs multiple types, available for the process, scheduling diverse remains critical process. To efficiently train model using we propose framework, i.e., Heterogeneous Parameter Server (HeterPS), composed architecture Reinforcement Learning (RL)-based method. advantages HeterPS three-fold compared existing frameworks. First, enables efficient workloads resources. Second, an RL-based method schedule workload each layer appropriate minimize cost satisfying throughput constraints. Third, manages storage communication among We carry out extensive experiments show that significantly outperforms state-of-the-art approaches in terms (14.5 times higher) monetary (312.3% smaller).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

Learning to Design Games: Strategic Environments in Deep Reinforcement Learning

In typical reinforcement learning (RL), the environment is assumed given and the goal of the learning is to identify an optimal policy for the agent taking actions through its interactions with the environment. In this paper, we extend this setting by considering the environment is not given, but controllable and learnable through its interaction with the agent at the same time. Theoretically, ...

متن کامل

Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning

This paper introduces Dex, a reinforcement learning environment toolkit specialized for training and evaluation of continual learning methods as well as general reinforcement learning problems. We also present the novel continual learning method of incremental learning, where a challenging environment is solved using optimal weight initialization learned from first solving a similar easier envi...

متن کامل

Reinforcement Learning with Deep Energy-Based Policies

We propose a method for learning expressive energy-based policies for continuous states and actions, which has been feasible only in tabular domains before. We apply our method to learning maximum entropy policies, resulting into a new algorithm, called soft Q-learning, that expresses the optimal policy via a Boltzmann distribution. We use the recently proposed amortized Stein variational gradi...

متن کامل

Vision-based Deep Reinforcement Learning

Recently, Google Deepmind showcased how Deep learning can be used in conjunction with existing Reinforcement Learning (RL) techniques to play Atari games[11], beat a world-class player [14] in the game of Go and solve complicated riddles [3]. Deep learning has been shown to be successful in extracting useful, nonlinear features from high-dimensional media such as images, text, video and audio [...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Future Generation Computer Systems

سال: 2023

ISSN: ['0167-739X', '1872-7115']

DOI: https://doi.org/10.1016/j.future.2023.05.032